Audio-Visual Self-Supervised Terrain Type Recognition for Ground Mobile Platforms

نویسندگان

چکیده

The ability to recognize and identify terrain characteristics is an essential function required for many autonomous ground robots such as social robots, assistive vehicles, exploration robots. Recognizing identifying challenging because similar terrains may have very different appearances (e.g., carpet comes in colors), while with appearance physical properties mulch versus dirt). In order address the inherent ambiguity vision-based recognition identification, we propose a multi-modal self-supervised learning technique that switches between audio features extracted from microphone attached underside of mobile platform image by camera on cluster types. labels are then used train image-based real-time CNN (Convolutional Neural Network) predict types changes. Through experiments, demonstrate proposed type method achieves over 80% accuracy, which greatly outperforms several baselines suggests strong potential applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Self-Supervised Visual Place Recognition Learning in Mobile Robots

Place recognition is a critical component in robot navigation that enables it to re-establish previously visited locations, and simultaneously use this information to correct the drift incurred in its dead-reckoned estimate. In this work, we develop a self-supervised approach to place recognition in robots. The task of visual loop-closure identification is cast as a metric learning problem, whe...

متن کامل

Pushing the envelope in deep visual recognition for mobile platforms

Image classification is the task of assigning to an input image a label from a fixed set of categories. It’s one of the core problems in computer vision that, despite its simplicity, has a large variety of practical applications. In a typical situation, a classification model takes as input a single image and assigns a single label or a probability distribution over the different labels as outp...

متن کامل

Self-Supervised Terrain Classification for Planetary Surface

Exploration Rovers 2 3 4 Christopher A. Brooks, Karl Iagnemma 5 Department of Mechanical Engineering 6 Massachusetts Institute of Technology 7 Cambridge, MA 02139 8 [email protected], [email protected] 9 10 Abstract 11 12 For future planetary exploration missions, improvements in autonomous rover 13 mobility have the potential to increase scientific data return by providing safe 14 access to geol...

متن کامل

Deep Texture Manifold for Ground Terrain Recognition

We present a texture network called Deep Encoding Pooling Network (DEP) for the task of ground terrain recognition. Recognition of ground terrain is an important task in establishing robot or vehicular control parameters, as well as for localization within an outdoor environment. The architecture of DEP integrates orderless texture details and local spatial information and the performance of DE...

متن کامل

Continuous Audio-visual Speech Recognition Continuous Audio-visual Speech Recognition

We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audiovisual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We tackle the problem of joint temporal model...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3059620